Problems and challenges of information resources producers' clustering

نویسندگان

  • Anna Cena
  • Marek Gagolewski
  • Radko Mesiar
چکیده

Classically, unsupervised machine learning techniques are applied on data sets with fixed number of attributes (variables). However, many problems encountered in the field of informetrics face us with the need to extend these kinds of methods in a way such that they may be computed over a set of nonincreasingly ordered vectors of unequal lengths. Thus, in this paper, some new dissimilarity measures (metrics) are introduced and studied. Owing to that we may use i.a. hierarchical clustering algorithms in order to determine an input data set’s partition consisting of sets of producers that are homogeneous not only with respect to the quality of information resources, but also their quantity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Children's Interactive Book in Iran A Review On Existing Situation And Production Challenges

Background and objective: Audience communication with digital media is bilateral and interactive. Book publishing in non-printed and digital formats has also created the feature of interactivity in books. This paper presents an attempt to identify the challenges of creating interactive childrenchr('39')s books in Iran by evaluating childrenchr('39')s interactive books published in Iran and the ...

متن کامل

Model the allocation of productive financial resources from the perspective of livelihood poverty indicators using a combination of clustering methods and SAW technique

Poverty is a social, economic, cultural and political reality that has long been one of the greatest human problems. The diversity of problems, needs and problems of the deprived and low-income groups of the society and the multiplicity of poverty indicators on the one hand, and on the other hand the lack of financial resources and credits to solve the poverty indicators, organizations in charg...

متن کامل

بهبود صحت ابهام‌زدایی نام نویسنده با استفاده از خوشه‌بندی تجمّعی

Today, digital libraries are important academic resources including millions of citations and bibliographic essential information such as titles, author's names and location of publications. From the view of knowledge accumulation management, the ability to search fast, accurate, desired contents, has a great importance. The complexity and similarity in these resources cause many challenges and...

متن کامل

Population Growth and the Interaction of Urban Environmental Challenges, Case Study: Zahedan

During the recent decades, urban management in Iran has increasingly confronted numerous challenges due to different kinds of social, cultural, political, executive, financial, and legal factors. The present study is going to specify and analyze the aforementioned challenges in the domain of the urban environment in order to determine how far they have been effective, how they have been priorit...

متن کامل

Optimal Feature Selection for Data Classification and Clustering: Techniques and Guidelines

In this paper, principles and existing feature selection methods for classifying and clustering data be introduced. To that end, categorizing frameworks for finding selected subsets, namely, search-based and non-search based procedures as well as evaluation criteria and data mining tasks are discussed. In the following, a platform is developed as an intermediate step toward developing an intell...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Informetrics

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2015